Optimal Planar Range Skyline Reporting with Linear Space in External Memory

نویسندگان

  • Yufei Tao
  • Jeonghun Yoon
چکیده

Let P be a set of n points in R. Given a rectangle Q = [α1, α2] × [β1, β2], a range skyline query returns the maxima of the points in P ∩Q. An important variant is the so-called topopen queries, where Q is a 3-sided rectangle whose upper edge is grounded at y = ∞ (that is, β2 = ∞). These queries are crucial in numerous database applications. In internal memory, extensive research has been devoted to designing data structures that can answer such queries efficiently. In contrast, currently there is no clear understanding about their exact complexities in external memory. This paper presents several structures of linear size for answering the above queries with the optimal I/O cost. We show that a top-open query can be solved in O(logB n+ k/B) I/Os, where B is the block size and k is the number of points in the query result. The query cost can be made O(log logB U + k/B) when the data points lie in a U × U grid for some integer U ≥ n, and further lowered to O(1 + k/B) if U = O(n). The same efficiency also applies to 3sided queries where Q is a right-open rectangle. However, the hardness of the problem increases if Q is a leftor bottom-open 3-sided rectangle. We prove that any linear-size structure must perform Ω((n/B) + k/B) I/Os to solve such a query in the worst case, where ǫ > 0 can be an arbitrarily small constant. In fact, leftand right-open queries are just as difficult as general (4-sided) queries, for which we give a linear-size structure with query time O((n/B) + k/B). Interestingly, this indicates that 4-sided range skyline queries have exactly the same hardness as 4-sided range reporting (where the goal is to report simply the whole P ∩Q). That is, the skyline requirement does not alter the problem difficulty at all.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Shared-Constraint Range Reporting

Orthogonal range reporting is one of the classic and most fundamental data structure problems. (2,1,1) query is a 3 dimensional query with two-sided constraint on the first dimension and one sided constraint on each of the 2nd and 3rd dimension. Given a set of N points in three dimension, a particular formulation of such a (2, 1, 1) query (known as four-sided range reporting in threedimension) ...

متن کامل

I/O-Efficient Dynamic Planar Range Skyline Queries

We present the first fully dynamic worst case I/O-efficient data structures that support planar orthogonal 3-sided range skyline reporting queries in O(log2B n+ t B1− ) I/Os and updates in O(log2B n) I/Os, using O( n B1− ) blocks of space, for n input planar points, t reported points, and parameter 0 ≤ ≤ 1. We obtain the result by extending Sundar’s priority queues with attrition to support the...

متن کامل

Skyline Queries in O(1) time?

The skyline of a set P of points (SKY (P )) consists of the "best" points with respect to minimization or maximization of the attribute values. A point p dominates another point q if p is as good as q in all dimensions and it is strictly better than q in at least one dimension. In this work, we focus on the static 2-d space and provide expected performance guarantees for 3-sided Range Skyline Q...

متن کامل

Optimal halfspace range reporting in three dimensions

We give the first optimal solution to a standard problem in computational geometry: three-dimensional halfspace range reporting. We show that n points in 3-d can be stored in a linear-space data structure so that all k points inside a query halfspace can be reported in O(log n + k) time. The data structure can be built in O(n logn) expected time. The previous methods with optimal query time req...

متن کامل

Output-sensitive Skyline Algorithms in External Memory

This paper presents new results in external memory for finding the skyline (a.k.a. maxima) of N points in d-dimensional space. The state of the art uses O((N/B) log M/B(N/B)) I/Os for fixed d ≥ 3, and O((N/B) logM/B(N/B)) I/Os for d = 2, where M and B are the sizes (in words) of memory and a disk block, respectively. We give algorithms whose running time depends on the number K of points in the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1208.4511  شماره 

صفحات  -

تاریخ انتشار 2012